๐Ÿฟ๏ธ ScourBrowse
LoginSign Up
You are offline. Trying to reconnect...
Copied to clipboard
Unable to share or copy to clipboard
๐Ÿ“‰ Embeddings Optimization

Embedding Quantization, Dimensionality Reduction, HNSW Indexing, Latent Space Transformations, Embedding Distillation

Learning ON Large Datasets Using Bit-String Trees
arxiv.orgยท23h
๐Ÿ”ขBitNet
Drama Model Inference Efficiency Boosted by 1.7x-2.3x
pytorch.orgยท4hยท
Discuss: Hacker News
๐Ÿง LLM Inference
An Efficient Dual-Line Decoder Network with Multi-Scale Convolutional Attention for Multi-organ Segmentation
arxiv.orgยท23h
๐Ÿง LLM Inference
TRIM: Accelerating High-Dimensional Vector Similarity Search with Enhanced Triangle-Inequality-Based Pruning
arxiv.orgยท23h
๐Ÿ—‚๏ธVector Indexes
Effective Clustering for Large Multi-Relational Graphs
arxiv.orgยท23h
๐Ÿ“ŠVector Clustering
Made an HF downloader app
github.comยท4hยท
Discuss: r/LocalLLaMA
๐Ÿ—œ๏ธZstd
SSFO: Self-Supervised Faithfulness Optimization for Retrieval-Augmented Generation
arxiv.orgยท23h
๐Ÿ”Information Retrieval
Disentangling Polysemantic Neurons with a Null-Calibrated Polysemanticity Index and Causal Patch Interventions
arxiv.orgยท23h
๐Ÿ”AI Interpretability
Introduction to Artificial Neural Networks โ€“ Part 1 (2013)
theprojectspot.comยท37mยท
Discuss: Hacker News
๐Ÿ“ŠVector Databases
MMTok: Multimodal Coverage Maximization for Efficient Inference of VLMs
arxiv.orgยท23h
๐Ÿง LLM Inference
CE-RS-SBCIT A Novel Channel Enhanced Hybrid CNN Transformer with Residual, Spatial, and Boundary-Aware Learning for Brain Tumor MRI Analysis
arxiv.orgยท23h
๐Ÿ“ŠEmbeddings
An experimental approach: The graph of graphs
arxiv.orgยท23h
๐Ÿ”Vector Search Algorithms
FedKLPR: Personalized Federated Learning for Person Re-Identification with Adaptive Pruning
arxiv.orgยท23h
๐ŸŽ›๏ธFeed Filtering
Reconciling Communication Compression and Byzantine-Robustness in Distributed Learning
arxiv.orgยท23h
๐ŸงฎLSH
OmniMRI: A Unified Vision--Language Foundation Model for Generalist MRI Interpretation
arxiv.orgยท23h
๐Ÿ”AI Interpretability
Paradigms of Intelligence Team
github.comยท14hยท
Discuss: Hacker News
๐Ÿ›ก๏ธAI Safety
DiCache: Let Diffusion Model Determine Its Own Cache
arxiv.orgยท23h
๐Ÿ’พPrompt Caching
A Laplace diffusion-based transformer model for heart rate forecasting within daily activity context
arxiv.orgยท23h
๐Ÿง LLM Inference
M^3-GloDets: Multi-Region and Multi-Scale Analysis of Fine-Grained Diseased Glomerular Detection
arxiv.orgยท23h
๐Ÿ“ŠVector Databases
Development of an isotropic segmentation model for medial temporal lobe subregions on anisotropic MRI atlas using implicit neural representation
arxiv.orgยท23h
๐Ÿ“ŠEmbeddings
Loading...Loading more...
AboutBlogChangelogRoadmap